System for non-planar specular reflections in hybrid ray tracing

ABSTRACT

The present invention teaches a real-time hybrid ray tracing system for non-planar specular reflections. The high complexity of a non-planar surface is reduced to low complexity of multiple small planar surfaces. Advantage is taken of the planar nature of triangles that comprise building blocks of a non-planar surface. All secondary rays bouncing from a given surface triangle toward object triangles keep a close direction to each other. A collective control of secondary rays is enabled by this closeness and by decoupling secondary rays from primary rays. The result is a high coherence of secondary rays.

CROSS-REFERENCE TO RELATED CASES

The present application is a continuation of U.S. application Ser. No. 16/909,063 filed Jun. 23, 2020, entitled “Specular Reflections In Hybrid Ray Tracing”, which is a Continuation of the U.S. application Ser. No. 16/662,657 filed Oct. 24, 2019, entitled “Method for Non-Planar Specular Reflections in Hybrid Ray Tracing” (now U.S. Pat. No. 10,699,468), which claims priority based on U.S. Provisional Application No. 62/755,548 filed Nov. 4, 2018, entitled “Method and System for Non-planar reflections in Path Tracing”; and claims priority based on U.S. Provisional Application No. 62/755,568 filed Nov. 5, 2018, entitled “Method and System for Decoupling in Path Tracing”; and is a Continuation-In-Part of the U.S. application Ser. No. 16/004,348 filed Sep. 6, 2018, entitled “Fast path traced reflections for augmented reality”; and is a Continuation-In-Part of the U.S. application Ser. No. 16/444,464 filed Jun. 18, 2019, entitled “Method for Fast Generation of Path Traced Reflections on a Semi-Reflecting Surface”; and is a Continuation-In-Part of the U.S. application Ser. No. 16/444,431 filed Jun. 18, 2019, entitled “Path Tracing System Employing Distributed Accelerating Structures”; all of which are hereby incorporated by reference.

FIELD OF THE INVENTION

The present invention relates to new and improved way of carrying out ray tracing method at a reduced computational complexity.

BACKGROUND OF THE INVENTION

Ray tracing is a technique for computing visibility between points. Light transport algorithms simulate the way light-rays propagate through space (while interacting with objects), attaining the resulting colors for the screen pixels. Primary rays are shot from the screen to the scene intersecting with scene objects and generating secondary rays that bounce all over the scene according to the physical laws of optics. (Note: unless specifically stated otherwise, the general term secondary rays used herein throughout the specification discussions, refers to secondary, ternary, and higher degree rays).

Ray tracing is capable of producing a very high degree of visual realism, higher than that of typical raster methods, but at a greater computational cost. Ray tracing is superior to raster graphics by its capability to simulate a wide variety of optical effects, such as glossiness, specularity, radiosity, reflection and refraction, scattering, soft shadows and more. The prior art ray tracing is one of the most computationally complex applications. As such, it is best suited for applications where the image can be rendered slowly ahead of time, such as in still images and film and television visual effects, and is poorly suited for real-time animated application of augmented reality where the real time animation is critical.

Path tracing. The traditional ray tracing is not necessarily photorealistic. True photorealism occurs when the rendering equation is closely approximated or fully implemented. Implementing the rendering equation gives true photorealism, as the equation describes every physical effect of light flow. However, this depends on the available computing resources. Path tracing, referred to as a Monte Carlo ray tracing gives far more accurate simulation of real-world lighting. Traditional path tracers [Kajiya, J. T. 1986. The rendering equation. In Proc. SIGGRAPH] shoot rays through each pixel, stochastically scattering according to the profile of the intersected object's reflectance and continuing recursively until striking a light source. Repeated sampling for any given pixel in the image space will eventually cause the average of the samples to converge to the correct solution of a rendering equation, making it one of the most physically accurate 3D graphic rendering methods in existence. Path tracing can generate images that are indistinguishable from photographs. Its visual quality is higher than that of a simple ray tracing, but at a much greater computational cost. (Note: unless specifically stated otherwise, the general term ray tracing, used herein throughout the specification discussions, refers to path tracing.)

A path tracer continuously samples pixels of the screen space. Rays are distributed randomly within each pixel in screen space and at each intersection with an object in the scene a new reflection ray, pointing in a random direction, is generated. After some number of bounces each ray eventually exits the scene or is absorbed.

Path tracing is a global illumination technique. Global illumination takes into account not only the light that comes directly from a light source, but also light reflected by surfaces in the scene, whether specular, diffuse, or semi-reflective. FIG. 1 depicts the sampling of diffuse inter-reflection from the surrounding environment, at a given surface point. In order to achieve global illumination on a diffuse surface, sampling rays must be shot from a hit point (HIP) 10. HIP is a result of a previous encounter between a ray (primary or secondary) and a triangle. The sampling at the HIP is done by shooting many rays, each in a random direction, within the boundaries of a hemisphere 11. The hemisphere is oriented such that its north pole is aligned with the surface normal N.

The relation between the deviation of a sampling ray from the normal N in the hemisphere, and its contribution to the aggregated light energy, is shown in FIG. 2. It is strongly connected with the BRDF (bidirectional reflectance distribution function) of the surface material. BRDF is a function of four real variables that defines how light is reflected at an opaque surface. According to the Monte Carlo technique each of the hemisphere rays is shot from the same HIP but at random direction, bounded by the hemisphere of FIG. 1. As a result, ray's sampled data contributes to the aggregated light energy at the HIP, according to the BRDF function.

Hybrid ray tracing (ray tracing interlaced with raster rendering) is a deferred rendering process based on raster rendering to calculate the primary rays collision, while the secondary rays use a ray tracing approach to obtain shadow, reflection and refraction effects. This approach vastly improves ray tracing performance, not only because many unnecessary traditional ray tracing tasks are avoided, but also because a complete image is available in a demanded time, even if there is not enough time to finish calculations of all the visual effects.

The concept of a hybrid Real-Time Raster and Ray Tracer renderer is not new. Beck et al [Beck et al [Beck, S., c. Bernstein, A., Danch, D., Frohlich, B.: CPU-GPU hybrid real time ray tracing framework (2005)] proposes a CPU-GPU Real-Time Ray-Tracing Framework. Beck proposal spread the traditional stages of ray tracing in independent tasks for the GPU and CPU. These render tasks can be summarized into three GPU render passes: a shadow map generation pass, a geometry identification pass and a blur pass.

Bikker [Bikker, J.: Real-time ray tracing through the eyes of a game developer. In: Proceedings of the 2007 IEEE Symposium on Interactive Ray Tracing, Washington, D.C., USA, IEEE Computer Society (2007)] developed a Real-Time Path Tracer called Brigade, which divides the rendering task seamlessly over both GPU and CPU available cores. Brigade aims the production of proof-of-concept games that use path tracing as the primary rendering algorithm.

Pawel Bak [Bak, P.: Real time ray tracing. Master's thesis, IMM, DTU (2010)] implements a Real-Time Ray Tracer using DirectX 11 and HLSL. Similar to Beck's work, his approach also uses rasterization in order to achieve the best possible performance for primary hits.

Chen [Chen, C. C., Liu, D. S. M.: Use of hardware z-buffered rasterization to accelerate ray tracing. In: Proceedings of the 2007 ACM symposium on Applied computing. SAC '07, New York, N.Y., USA, ACM (2007) 1046-1050] presented a hybrid GPU/CPU ray tracer renderer, where a Zbuffered rasterization is performed to determine the visible triangles at the same time that primary ray intersections are determined. The CPU reads the data back in order to trace secondary rays.

Sabino et al. [Thaler Sabino, Paulo Andrade, Esteban Gonzales Clua, Anselmo Montenegro, Paulo Pagliosa, A Hybrid GPU Rasterized and Ray Traced Rendering Pipeline for Real Time Rendering of Per Pixel Effects, Univ. Federal Fluminense, Rio de Janeiro, Brazil, 2013] present a heuristic approach that select a subset of relevant objects to be ray traced, avoiding traversing rays for objects that might not have a significant contribution to the real time experience.

An important strategy in real-time hybrid ray tracing, is the use of GPU for raster techniques to improve the performance and a smart strategy for prioritizing regions and objects that will receive the ray tracing light effects. NVIDIA's OptiX [Parker, S. G., Bigler, J., Dietrich, A., Friedrich, H., Hoberock, J., Luebke, D., McAllister, D., McGuire, M., Morley, K., Robison, A., Stich, M.: Optix: A general purpose ray tracing engine. ACM Transactions on Graphics (August 2010)] is a general-purpose ray tracing engine targeting both NVIDIA's GPUs and general-purpose hardware in the current version. OptiX architecture offers a low-level ray tracing engine, a programmable ray tracing pipeline with a shader language based on CUDA C/C++, a domain-specific compiler and a scene-graph based representation. OptiX is a GPU only solution with remarkably good results for interactive ray tracing. Recently Nvidia OptiX has got a support by Nvidia RTX, which is a development platform for hybrid ray tracing, for a special purpose hardware. It runs on Nvidia Volta- and Turing-based GPUs, specifically utilizing an architecture for ray tracing acceleration.

Despite all the hybrid ray tracing developments, hybrid real-time ray tracers on low-power devices do not exist in prior art. Their applicability on low-power devices, such as laptops, tablets, hand-held mobiles, becomes more and more relevant. The likelihood of running ray tracing on low power devices is forecasted only to the thirties: “By Moore's law alone by 2032 we could be running real time ray tracing on mobile phones.” Jon Peddie, “Peddie predicts we could have real time ray tracing on our PCs in less than 6 years”, TechWatch, 27 Mar. 2018.

The hurdles of path tracing that prevent a real-time performance in prior art, either of a full-blown ray tracing or of a hybrid ray tracing are: traversal and frequent reconstruction of accelerating structures, lack of coherence of secondary rays, and a noise causing a “film-grain” appearance of images.

Real-time Ray Tracing (RTRT). Historically, ray tracing had been reserved to off-line applications, such as computer-generated photo-realistic animated films. Real-time applications of video games, virtual and augmented reality had have to rely on rasterization for their rendering. RTRT is a hard-computational task, not only because each pixel in the image must be calculated separately, but also because the final color of a single pixel can be affected by more than one recursive ray. Another consideration is that ray tracing algorithms waste from 75% to 95% of its execution time calculating intersection points between rays and objects. RTRT has been enabled by Nvidia's RTX in 2018 (Alwani, Rishi. “Microsoft and Nvidia Tech to Bring Photorealistic Games With Ray Tracing”, Gadgets 360. Retrieved Mar. 21, 2018), facilitating a new development in computer graphics of generating interactive images that react to lighting, shadows, reflections by special purpose hardware. Nvidia's RTX is based on traditional ray tracing algorithm accelerated by an on-chip supercomputing hardware of closely 5000 cores. It comprises of a GPU having 4352 cores, Al denoiser utilizing 544 cores, and intersection tests accelerator of 68 cores. The power requirement of a single RTX2080 GPU is 250 W, and the price starts at €418. Due to the high cost and high power of RTX it is targeted at the high-end video games. However, there is a great need to enable real-time ray tracing on consumer class devices (smartphones, tablets, laptops and PC), for video games, VR and AR, democratizing ray tracing to the massive audience which includes limited powered devices. For this end a new and novel method, based on radical algorithmic improvements is needed.

Reflections. In prior art's hybrid ray tracing the reflections are generated based on G-buffer (Luis Sabino et al., A Hybrid GPU Rasterized and Ray Traced Rendering Pipeline for Real Time Rendering of Per Pixel Effects, 2013). The G-Buffer is generated during the first stage by raster rendering, a “differed shading” stage. The basic idea behind deferred shading is to perform all visibility tests before performing any lighting computations. Therefore, at first, visibility tests are done by raster rendering, while shading is differed to a later stage, combined with ray tracing. The G-buffer produced by the deferred shading stage contains information about optical properties of the underlying material of each pixel. Its contents are used to determine the need for tracing reflection/refraction rays. It is composed by reflectivity, index of refraction, specular exponent and opacity, respectively. The rays need to be traced from the surfaces only through the scene. This way enables to avoid trace of unnecessary rays in places where the material is neither refractive nor reflective. After differed shading is done, the ray tracing algorithm starts with secondary rays and can follow its own path. Any secondary ray generated will be traced against scene in order to produce global illumination effects, such as reflections and refractions. The result of this stage can be understood as the generation of a ray trace effects layer. This effects layer will be blended to the image already generated, in order to improve its visual quality with global illumination effects.

According to the G-buffer method the secondary rays are a natural extension of primary rays. Ray tracing that is carried-on by the chosen secondary rays suffer from the same difficulties of conventional ray tracing: lack of coherence of secondary rays and images with stochastic noise.

Lack of ray coherence of secondary rays. Coherence of rays is the key to efficient parallelization of ray tracing. In prior art ray tracing the primary and shadow rays are coherent. This coherence is exploited for efficient parallel processing: traversing, intersecting, and shading by packets of coherent rays. They work well for nearby primary rays, since these rays often traverse similar parts of the accelerating data structure. Using this approach, we can reduce the compute time by using SIMD instructions on multiple rays in parallel, reduce memory bandwidth by requesting data only once per packet, and increase cache utilization at the same time. This works fine for primary rays that originate from the camera. Unfortunately, it is not possible to use ray packets effectively with rays of an advanced order (secondary, ternary, etc.). The primary reason is that advanced order rays bounce in different direction losing coherence. Moreover, in path tracing there is an intentional randomization of rays for diffuse reflections. Reorganizing secondary rays to form bundles with higher coherence ratios, are practiced by the prior art. But this kind of regrouping is a quite expensive operation since it involves a scatter/gather step, which may result in only a slight frame rate improvement when reordering is applied.

Sadeghi et al. [Iman Sadeghi, Bin Chen, and Henrik Wann, Coherent Path Tracing, Jensen University of California, San Diego, 2009], developed a technique for improving the coherency of secondary rays. This technique uses the same sequence of random numbers for generating secondary rays for all the pixels in each sample. This improves the efficiency of the packet tracing algorithm but creates structured noise patterns in the image.

Noisy images. A path tracer continuously samples pixels of the screen space. The image starts to become recognizable after only a multiple samples per pixel. Rays are distributed randomly within each pixel in screen space and at each intersection with an object in the scene a new reflection ray, pointing in a random direction, is generated. After some number of bounces each ray eventually exits the scene or is absorbed. When a ray has finished bouncing about in the scene a sample value is calculated based on the objects the ray bounced against. The sample value is added to the average for the source pixel. The random components in ray tracing cause the rendered image to appear noisy. The noise decreases over time as more and more samples are calculated. The defining factor for render quality is the number of samples per pixel (SPP). The higher SPP you have in a rendered image the less noise will be noticeable. However, the added quality per sample decreases the more samples you have already (since each sample is just contributing to an average over all samples). The difference in image quality between, for example, 20,000 SSP and 21,000 SSP will not be as noticeable as between 1,000 SSP and 2,000 SSP. The initial high screen resolution transforms to a low spatial resolution that decreases quickly as the rays are progressing deeper into the space. Due to the low spatial resolution each one of the produced frames is noisy. Only converge of many subsequent frames reduces the final image noise. The image to converge and reduce noise to acceptable levels usually takes around 5000 samples for most path traced images, and many more for pathological cases. Noise is particularly a problem for animations, giving them a normally unwanted “film-grain” quality of random speckling.

Accelerating structures. The most time-consuming tasks in ray tracing are intersection tests between millions of rays and millions of polygons. They are partly relieved by use of acceleration structures (AS) which are huge binary trees. Every single ray is traversed across an accelerating structure (e.g. K-trees or BVH trees), seeking polygons for intersection. These traversals become a major time-consuming task—they typically take over 70% of the image generation time. The polygons near to the path of the traversing ray are subject to intersection test.

The AS based on binary trees are basically static. They are constructed in a preprocessing step. Such a step takes typically much more time than rendering an image. The construction time depends on the scene size. Every major modification of the scene necessitates a reconstruction of the static acceleration structures. Moreover, the memory size is typically doubled by these structures.

There are two major drawbacks associated with the use of static acceleration structures; (i) traversals of these structures are time-consuming, challenging the real-time requirements, and (ii) they must be repeatedly reconstructed upon scene changes, which contradicts with real time animation. Reconstructing static acceleration structure is a computationally intensive task, preventing real-time animation.

SUMMARY OF THE PRESENT INVENTION

The present invention teaches a real-time hybrid ray tracing method for non-planar specular reflections.

The prior art hybrid ray tracing inherits the main hurdles of the classic ray tracing: incoherence of secondary rays, noisy images and lack of real-time animation.

In the hybrid ray tracing of present invention, these hurdles of prior art have been eliminated by: (i) high coherence of secondary rays, and (ii) dynamic acceleration structures that replace the prior art's static ones.

According to the basic principle of ray tracing the reflection effects are modelled by secondary and higher order rays. In prior art, according to the G-buffer method of hybrid ray tracing, secondary rays are a natural extension of primary rays. Secondary rays, that survived the reduction of unnecessary rays in places where the material is neither refractive nor reflective, follow their own path against the scene. The paths from non-planar surfaces are much more dispersed than from planar ones. As such, there is no way to control these rays for coherence or reduced noise.

The present invention deals only with reflections from non-planar objects. The planar reflection has been described in a previous patent application U.S. Ser. No. 16/004,348 Fast Path Traced Reflections for Augmented Reality, to which the present invention claims priority. The bouncing direction of secondary rays is deterministic for perfectly specular (reflective) surfaces, or random (see FIG. 1) for diffuse (glossy) surfaces. The current invention deals with specular (reflective) non-planar reflections.

According to the present invention, the high complexity of non-planar surface is reduced to low complexity of multiple small planar surfaces. Advantage is taken of the planar nature of triangles that comprise building blocks of a non-planar surface. All secondary rays bouncing from a given triangle toward object triangles keep a close direction to each other. A collective control of secondary rays is enabled by this closeness and by decoupling secondary rays from primary rays. The result is high coherence of secondary rays.

BRIEF DESCRIPTION OF THE DRAWINGS

The invention is herein described, by way of non-limiting examples only, with reference to the accompanying figures and drawings, wherein like designations denote like elements. Understanding that these drawings only provide information concerning typical embodiments of the invention and are not therefore to be considered limiting in scope:

FIG. 1. Prior art. Hemisphere of diffuse reflection used in path tracing.

FIG. 2. Prior art. Statistical distribution on a unit hemisphere.

FIG. 3. The principle of the method, including the surface and object triangles, primary and secondary rays.

FIG. 4. HIPs covering surface triangle.

FIG. 5. Primary and secondary rays relating to surface triangle.

FIG. 6. Matching secondary rays to primary HIPs.

FIG. 7. Non-planar setup of surface triangle with single primary and single secondary ray.

FIG. 8. Non-planar setup of surface triangle with beam of primary and beam of secondary rays.

FIG. 9. Point of visibility from surface triangle.

FIG. 10. Another point of visibility from surface triangle.

FIG. 11. Reflection of object triangle in surface triangle.

FIG. 12. Clipped reflection of object triangle in surface triangle.

FIG. 13a . Creation of secondary ray, early segment.

FIG. 13b . Creation of secondary ray, main segment.

FIG. 14. Flowchart of embodiment of present invention.

FIG. 15. An example of reflected object in specular non-planar surface.

FIG. 16. Prior art—GPU graphics pipeline

DETAILED DESCRIPTION OF THE PRESENT INVENTION

The principles and operation of an apparatus according to the present invention may be understood with reference to the figures and the accompanying description wherein similar components appearing in different figures are denoted by identical reference numerals. The drawings and descriptions are conceptual only. In actual practice, a single component can implement one or more functions; alternatively, each function can be implemented by a plurality of components and devices. It will be readily understood that the components of the present invention, as generally described and illustrated in the figures herein, could be arranged and designed in a wide variety of different configurations. Thus, the following more detailed description of the embodiments of the apparatus, system, and method of the present invention, as represented in the figures herein, is not intended to limit the scope of the invention, as claimed, but is merely representative of embodiments of the invention.

Unless specifically stated otherwise, as apparent from the following discussions, it is appreciated that throughout the specification discussions, utilizing terms such as “processing”, “computing”, “calculating”, “determining”, “generating”, “creating” or the like, refer to the action and/or processes of a computer or computing system, or processor or similar electronic computing device, that manipulate and/or transform data represented as physical, such as electronic, quantities within the computing system's registers and/or memories into other data, similarly represented as physical quantities within the computing system's memories, registers or other such information storage, transmission or display devices.

Embodiments of the present invention may use terms such as processor, computer, apparatus, system, sub-system, module, unit, and device (in single or plural form) for performing the operations herein. This may be specially constructed for the desired purposes, or it may contain a general purpose computer selectively activated or reconfigured by a computer program stored in the computer. Several technical terms specifically associated with our disclosure are herein defined.

Computer graphics pipeline refers to the most common form of computer 3D rendering, 3D polygon rendering, distinct from raytracing, and ray casting. In particular, in ray casting, a ray originates at the point where the camera resides, if that ray hits a surface, then the color and lighting of the point on the surface where the ray hit is calculated. In 3D polygon rendering the reverse happens, the area that is in view of the camera is calculated, and then rays are created from every part of every surface in view of the camera and traced back to the camera. The graphics pipeline is usually used in real-time rendering.

Rendering a projection is a 3D computer graphics process of automatically converting 3D wire frame models into 2D images rendering on a computer. The projection can be of a perspective, parallel, inverse or of another shape.

Render target is a feature of modern graphics processing units (GPUs) that allows a 3D scene to be rendered to an intermediate memory buffer, or Render Target Texture (RTT), instead of the frame buffer or back buffer. This RTT can then be manipulated by pixel shaders in order to make searches or apply effects to the final image.

Primary rays, are the first generation of rays in ray tracing, cast into the scene from camera or from eye to solve for visibility, i.e. to find whether the primary ray intersects a surface.

Secondary rays in ray tracing are spawned from primary rays at ray-polygon intersection points. They are used to compute things like shadows, reflections, refractions, etc. Herein, we use the term collectively for all successive generations as well, such as for ternary rays, fourth generation, etc.

Global illumination, is a general name of a group of algorithms used in 3D computer graphics that are meant to add more realistic lighting to 3D scenes, taking into account not only the light that comes directly from a light source (direct illumination), but also subsequent cases in which light rays from the same source are reflected by other surfaces in the scene, whether reflective or not (indirect Illumination).

Color bleeding in computer graphics is the phenomenon in which objects or surfaces are colored by reflection of indirect light from nearby surfaces. This is a visible effect that appears when a scene is rendered with full global illumination.

Accelerating structures, such as grids, octrees, binary space partitioning trees (BSP trees), kd-trees and BVHs (bounding volume hierarchy), are used in ray tracing to solve for visibility, allowing improvement of render times in speed and efficiency, as compared to naïve ray tracing without accelerating structures.

Object can stand for a primitive (polygon, triangle, solid, etc.), or a complex object made out of primitives.

Hit point is a point where a ray intersects an object. Termed also HIP.

Visibility—given a set of obstacles in the Euclidean space, two points in the space are said to be visible to each other, if the line segment that joins them does not intersect any obstacles.

Scene, a collection of 3D models and light sources in world space, into which a camera may be placed, describing a scene for 3D rendering. Scene model elements include geometric primitives: points or vertices; line segments or edges; polygons or faces.

Clipping, in the context of computer graphics, is a method to selectively enable or disable rendering operations within a defined region of interest.

Surface triangle (termed also receiving triangle)—in Computer Graphics triangles are the primitive elements of any 3D object. In the present invention, for short, surface triangles are used for the primitives of a reflective surface.

Object triangle—in Computer Graphics triangles are the primitive elements of any 3D object. In the present invention, for short, object triangles are used for the primitives of an object that is processed for reflection in a reflective surface.

SIMD—(single instruction multiple data) is a class of parallel computers in Flynn's taxonomy. It describes computers with multiple processing elements that perform the same operation on multiple data points simultaneously. SIMD is particularly applicable to common tasks of Computer Graphics. Modern GPU designs include SIMD instructions to improve the performance.

Specular reflection—is the mirror-like reflection of light from a surface. In this process, each incident ray is reflected at the same angle to the surface normal as the incident ray, but on the opposing side of the surface normal in the plane formed by incident rays. The result is that an image reflected by the surface is reproduced in mirror-like (specular) fashion. Throughout the present invention a specular surface is also termed reflective surface.

Diffuse reflection—is the reflection of light from a surface such that a ray incident on the surface is scattered at many angles rather than at just one angle as in the case of specular reflection. Termed also Glossy throughout the present invention.

Phantom camera—as opposed to the real camera from which the 3D scene is viewed. Phantom camera is a reflected location of the real camera in a specific virtual plane. It is used in present invention to generate an alternative reflection.

The processes/devices and displays presented herein are not inherently related to any particular computer or other apparatus, unless specifically stated otherwise. Various general-purpose systems may be used with programs in accordance with the teachings herein, or it may prove convenient to construct a more specialized apparatus to perform the desired method. The desired structure for a variety of these systems will appear in the description below. In addition, embodiments of the present invention are not described with reference to any particular programming language. It will be appreciated that a variety of programming languages, graphics APIs, or operating systems may be used to implement the teachings of the inventions as described herein.

In hybrid ray tracing global illumination effects, such as reflections, are produced by secondary rays. The result of secondary stage can be understood as generation of a ray traced reflection layer by secondary rays. This layer is blended to the image already generated by raster projection of primary rays, in order to improve its visual quality with a global illumination effect.

In prior art once primary rays are shot at the scene, secondary rays need to be traced continuing the path from the primary hit points through the scene in the direction of light sources, generating reflections. The parallel processing of secondary rays is limited due to incoherence of secondary rays, particularly in the case of non-planar surfaces.

Due to high coherence of secondary rays, the present invention enables a highly parallel processing of ray tracing. High coherence is gained by handling the secondary rays collectively and decoupling them from primary rays. As a result, the secondary rays are coherent, well suited for SIMD mechanisms. Furthermore, due to coherence of secondary rays and initial scene covering by primary rays, the stochastic noise is eliminated.

During the scene-design the surfaces that are candidates to reflect, either specular or diffuse, are selected as receiving (reflective) surfaces on which reflections are generated. These receiving surfaces might be planar (U.S. Ser. No. 16/004,348) or non-planar (current invention).

The principle of the method of present invention is illustrated in FIG. 3. A non-planar receiving surface 308 as well as the reflected object 303 consist of triangles and are handled as triangles. Each receiving triangle of the non-planar surface, such as 301, is handled separately. Triangle 303 represents an object to be reflected in the receiving triangle 301. During the initial phase, primary rays (between rays 311-312) hit the receiving triangle, creating multiple primary hit points, e.g. 41 and 42 in FIG. 4. The secondary rays are created collectively as an independent beam (between rays 309-310), targeted at the object 303. At the first stage they are decoupled from the primary hit points and treated collectively.

This opposes the prior art where HIPs are taken as an explicit start-point for secondary rays, such that each newly created secondary ray becomes an unambiguous continuation of its originator, primary ray. Moreover, each continuing secondary ray is handled individually, following its own path against the scene. In non-planar surfaces these paths are highly dispersed, hence incoherence of secondary rays.

Note: secondary is used in its broad meaning: between secondary and primary rays, or between ternary and secondary, etc. For the sake of simplicity, in the upcoming text we denote secondary and primary rays, but the same description can be applied to other couples of ray generations as well.

According to present invention, bundles of secondary rays are generated and handled separately for each of the surface triangles, taking advantage of their planar nature. Since all secondary rays bouncing from a given triangle keep a close direction, they are handled collectively as part of a common projection. But during the final collecting stage each of the secondary rays is matched to one of the primary rays, for delivering the collected light values to image pixels. An embodiment of secondary-to-primary matching of present invention is illustrated in FIGS. 5-6.

In FIG. 5 a beam 50 of secondary rays shot from receiving triangle toward a reflected object is shown. The secondary rays may outnumber the HIPs of primary rays. The way the secondary rays, after collecting the light values from the reflected object, are matched to primary rays is shown in FIG. 6. Two HIPs are shown, 40 and 41. The secondary rays are matched with primary rays by their nearness; rays in the bundle 61 are matched to 40, while 62 are matched to 41.

In FIG. 7, a single primary ray 71 is shot at a receiving triangle 76, hitting it at HIP 77. Then secondary ray 74, which is part of the beam of secondary rays 75, is in a proximity to primary HIP 77, intersected by the primary ray 71. These two rays are matched sharing their collected light values. The shown HIP's hemisphere 73 is in use for diffuse (glossy) non-planar surfaces.

It should be noted that the number of secondary rays in the segment may be arbitrary, according to the needs. The more rays, the higher the spatial resolution.

While FIG. 7 refers only to a single primary ray and single secondary ray that can be matched around a single point on the receiving triangle 76, FIG. 8 refers to all primary rays and all secondary rays on the same triangle setup. A beam of primary rays bounded by 82-83 is shot to the receiving triangle 76, and a beam of secondary rays is generated from the receiving triangle to the object triangle 78.

For diffuse (glossy) reflection, for each secondary ray originating at surface 79, its own hemisphere 73 is used. There is room for multiple secondary rays in the hemisphere, each of its own visibility of the object triangle 78, according to ray's direction. The entire range of visibility of the object triangle from the hemisphere of ray 74 is shown in FIG. 9. The hemisphere section 91 is the loci of many possible secondary rays shot from the same origin on the surface 92 in varying directions, all having visibility of the object-triangle 78. Each ray in the segment can be weighted differently according to its importance (distance from the normal N on the hemisphere), BRDF of the material and a statistical distribution on a unit hemisphere (FIG. 2).

For comparison, FIG. 10 shows another secondary ray 75, shot from another point of the same surface 92, having a different hemisphere section of visibility. The use of hemisphere of FIG. 1 applies to diffuse (glossy) reflections only. For specular reflections a single deterministic secondary ray from each point at the receiving triangle is shot, and the use of hemisphere does not apply.

Calculating specular reflection of object 111 on receiving triangle 110 is illustrated in FIG. 11. A single deterministic secondary ray from each point of the receiving triangle is shot, comprising a ray belonging to the perspective projection 113-114. A weighted normal N of the receiving triangle 110 is calculated by interpolating the receiving-triangle's normals at each of its vertices. A virtual plane 116 perpendicular to the weighted normal N is calculated, and a location of a phantom camera is defined 115, to reflect the original camera 117 at the virtual plane 116. This phantom camera is the origin of the projection related to the receiving-triangle 110. Such a projection is shown 112 bounded by the rays 113, 114. The object triangle 111, as it is visible from the receiving triangle 110 is projected on the triangle's face, when its vertices define the reflected image 112 of the object. The object triangle must be transformed to the surface of the receiving triangle. The most efficient way to perform such transformation is by the vertex shader of a GPU graphics pipeline.

A slightly different case of a specular reflection, when the reflected triangle does not entirely fall within the receiver-triangle, is illustrated in FIG. 12. In such a case the reflected object 122 must undergo a clipping according to the receiver's boundaries, as shown.

In order to reach the reflected object from the receiving triangle by a regular raster projection mechanism, the projection from the phantom camera toward the receiving triangle and farther to the reflected object is done in two steps. In the first step a relative depth map of the receiving triangle from the phantom camera is generated, for filtering out any object that might befall between them two. This depth map is utilized as a guiding mask to enable concentrating on the main segment of the secondary ray 1312, starting at point 1311 and excluding the early segment, as shown in FIG. 13a . As shown in FIG. 13b , the main segment of the secondary ray intersects the object triangle 1322 at 1321.

The method is summarized in the flowchart of FIG. 14. As a preprocessing step the scene designer selects in the scene non-planar surfaces as candidates for specular reflectivity and per each surface selects objects to be reflected in the surface. In run time, per each selected surface, primary rays are shot to cover the surface with primary HIPs 1401. These primary HIPs do not serve as direct bouncing points for secondary rays, they are matched in a final stage. The non-planar receiving surface is treated by its triangles, which are planar entities. Per each surface triangle 1402 a location of phantom camera is calculated 1403, projection is done from the phantom camera, going through the surface triangle and intersecting with secondary rays the object triangles. All object triangles are processed one by one for each surface triangle 1404. Visibility is tested for each object triangle by ray/polygon intersection tests 1405. There are three optional test results 1406: (i) not visible, then the test passes to the next object triangle, (ii) partly visible, then clipping of the object triangle to the surface triangle must be performed 1407, and (iii) fully visible. The last two options are followed by geometric transformation of the object triangle to the receiving triangle (and clipping to the receiving triangle if required), and shading 1408. Light values are sampled at the intersection points between secondary rays and the object triangle 1409. When current surface triangle is done with all object triangles 1411, next surface triangle is taken. When all surface triangles are done 1413, all the secondary rays are matched with primary HIPs 1414, the light values of the primary and secondary rays are composited 1415, and the composited results are transferred to the respective image pixel 1416.

FIG. 15 depicts an example of reflected object on a specular non-planar surface. The reflective surface of the kettle 150 consists of mesh of polygons (mesh not shown) on which an object 151 is reflected. The reflected object is treated as a mesh of triangles.

The method of the present invention is based on multiple projections, geometric transformations, texture wrapping and general computing. Although 3D rendering can be accomplished by CPU, but the most efficient way for above tasks is by graphics pipeline of GPU. FIG. 16 shows a prior art graphics pipeline. The GPU graphics pipeline executes the most common form of 3D rendering, as required by the present invention. The above tasks are performed by vertex, geometry, computing and fragment shaders. Moreover, as the present invention eliminates the incoherence of secondary rays, using the GPUs is the best alternative due to its parallel nature by its concept. For example, the most recent Nvidia RTX2080 GPU comprises 4352 processing cores (unified shaders). Their supercomputing level of processing can be exploited only by utmost parallelism which is enabled by present invention's coherence of secondary rays. Thus, GPU is well suited for the required tasks of present invention. 

The invention claimed is:
 1. Computer system for hybrid ray tracing generating reflections of objects in a non-planar specular surface, utilizing a graphics processing unit, the system comprising: at least one general purpose processor, at least one graphics processing unit, computer memory, and geometric data of scene objects, wherein in runtime, a) for each triangle of a non-planar specular surface rays are generated such that they are first projected to that triangle and further, as secondary rays, to the objects' triangles; b) to each surface's triangle all its visible objects' triangles or part thereof are transformed and shaded; c) per all the secondary rays of each triangle of the non-planar specular surface, light values of the transformed and shaded objects' triangles are sampled; d) primary rays are projected on the non-planar specular surface; e) per each triangle of the non-planar specular surface, matching the secondary rays and the primary rays; f) the light values of the matched primary and secondary rays are composited; and g) all the composited light values are transferred to their respective image pixels.
 2. The system of claim 1 wherein the secondary rays bouncing from surface triangles of the non-planar specular surface are decoupled from primary rays.
 3. The system of claim 1 wherein each triangle of the non-planar specular surface is treated separately as a planar surface.
 4. The system of claim 1 wherein secondary rays shot from each triangle of the non-planar specular surface are treated collectively.
 5. The system of claim 1 wherein the visibility of object triangles or part thereof is tested by intersecting secondary rays with object triangles.
 6. The system of claim 1 wherein non-visible parts of object triangles are clipped out.
 7. The system of claim 1 wherein light values are sampled at the intersection points between secondary rays and object triangles.
 8. The system of claim 1 wherein secondary rays are matched with primary rays by means of hit points created by primary rays on the non-planar specular surface.
 9. The system of claim 1 wherein light values contributed by secondary rays create a ray traced reflection layer.
 10. The system of claim 9 wherein the ray traced reflection layer is composited with light values of a raster projection of primary rays, for improved visual quality of generated image.
 11. A method comprising: for each triangle of a non-planar specular surface, generating rays that are first projected to that triangle and further, as secondary rays, to the objects' triangles; to each surface's triangle; transforming and shading all its visible objects' triangles or part thereof; per all the secondary rays of each triangle of the non-planar specular surface, sampling light values of the transformed and shaded objects' triangles; projecting primary rays on the non-planar specular surface; per each triangle of the non-planar specular surface, matching the secondary rays and the primary rays; compositing the light values of the matched primary and secondary rays; and transferring all the composited light values to their respective image pixels.
 12. The method of claim 11, further comprising decoupling from primary rays the secondary rays bouncing from surface triangles of the non-planar specular surface.
 13. The method of claim 11 wherein each triangle of the non-planar specular surface is treated separately as a planar surface.
 14. The method of claim 11 wherein secondary rays shot from each triangle of the non-planar specular surface are treated collectively.
 15. The method of claim 11 wherein the visibility of object triangles or part thereof is tested by intersecting secondary rays with object triangles.
 16. The method of claim 11, further comprising clipping non-visible parts of object triangles.
 17. The method of claim 11 wherein light values are sampled at the intersection points between secondary rays and object triangles.
 18. The method of claim 11 wherein secondary rays are matched with primary rays by means of hit points created by primary rays on the non-planar specular surface.
 19. The method of claim 11 wherein light values contributed by secondary rays create a ray traced reflection layer.
 20. A non-transitory computer-readable medium comprising computer-readable instructions that, when executed by one or more processors, configure the one or more processors to perform operations comprising: for each triangle of a non-planar specular surface, generating rays that are first projected to that triangle and further, as secondary rays, to the objects' triangles; to each surface's triangle, transforming and shading all its visible objects' triangles or part thereof; per all the secondary rays of each triangle of the non-planar specular surface, sampling light values of the transformed and shaded objects' triangles; projecting primary rays on the non-planar specular surface; per each triangle of the non-planar specular surface, matching the secondary rays and the primary rays; compositing the light values of the matched primary and secondary rays; and transferring all the composited light values to their respective image pixels. 